Skip to main content

55
UPLOADS


Media Type
24
collections
10
audio
8
movies
5
texts
3
data
3
images
More right-solid
Year
3
2018
17
2017
6
2016
2
2015
10
2014
6
2013
More right-solid
Topics & Subjects
14
digital preservation
13
archives
13
web archiving
12
libraries
6
APIs
6
congress
More right-solid
Collection
More right-solid
Creator
10
metropolitan new york library council and audiovisual preservation solutions
7
jefferson bailey
3
stanford university libraries
2
ed summers
1
ia web archiving group
1
jefferson bailey, herbert van de sompel
More right-solid
Language
30
English
1
Estonian
SHOW DETAILS
up-solid down-solid
eye
Title
Date Archived
Creator
WASAPI Project - Web Archiving Systems APIs
Dec 3, 2018 Jefferson Bailey
texts
eye 3
favorite 0
comment 0
Year 2 performance report for the WASAPI project, IMLS grant LG-71-15-0174-15, "Systems Interoperability and Collaborative Development for Web Archiving.
Topics: web archiving, digital preservation, APIs
Web Archive Domain Files
data
eye 14
favorite 0
comment 0
dotnl-2016-present-domains-in-wayback-domain+year-of-last-capture
Topics: domains, web archive, lists
Web Archive Domain Files
collection
1
ITEMS
39
VIEWS
Apr 18, 2018 IA Web Archiving Group
collection
eye 39
Random set of domain lists extracted from the IA web archive, usually ccTLD lists, often at the request of partners, especially National Libraries. File name will (hopefully) describe file contents. Posted "as is" so you get what you get.
Topics: web archive, domains, zone file
The Early .gov Web Domain
collection
6
ITEMS
163
VIEWS
Dec 15, 2017
collection
eye 163
A special research collection of the entire .gov web domain from 1996-2001 as represented within IA's web archive.
Topics: government, web, research
Web Data Services
collection
605
ITEMS
363,351
VIEWS
Dec 15, 2017
collection
eye 363,351
Datasets, special collections, and other derived and extracted subsets of web data culled from IA's web archive. Many of these datasets were created in relation to specific partnerships and collaborative projects supporting computational research and data mining using web archives.
Topics: research, dataset, web
web_locrl
collection
1,224
ITEMS
16
VIEWS
Oct 20, 2017
collection
eye 16
Files related to general web crawls.
Topic: web
Corporation Websites Collection
collection
597
ITEMS
582,799
VIEWS
Sep 21, 2017
collection
eye 582,799
This collection contains an extracted web archive corpus of 0.8+ million corporate websites (from an original list of ~0.98 websites) extracted from the archive.org web archive, covering the period 1996 to early 2017. This corpus was originally created as a collaboration between the Internet Archive and a group at Dartmouth University, but it may be useful to other researchers. Updated or more detailed information may exist at:...
Topics: websites, corporations, homepages
End Of Term 2016 UNT Crawls
collection
1,275
ITEMS
3.1M
VIEWS
Aug 28, 2017
collection
eye 3.1M
End of Term 2016 Web Archive government web crawls by project partner the University of North Texas.
Topics: end of term, federal government, 2016, president, congress, university of north texas
Web Archive Reading Rooms
Jun 22, 2017 Jefferson Bailey
image
eye 54
favorite 0
comment 0
Viewing the first capture of https://www.kb.nl/ homepage as found in the onsite-only-access web archive of the Koninklijke Bibliotheek (the National Library of the Netherlands) in the Hague, Netherlands in June 2017. (Technically via the in-building network from the desk of Kees Teszelszky. Thank you Kees!)
Topics: web archiving, open access, digital preservation
Web Archive Reading Rooms
Jun 22, 2017 Jefferson Bailey
image
eye 22
favorite 0
comment 0
Viewing a 1990s-era archived archive.org webpage as found in the onsite-only-access web archive of the Bibliothèque nationale de France François-Mitterrand Library in Paris, France in June 2017.
Topics: web archiving, open access, digital preservation
Web Archive Reading Rooms
collection
2
ITEMS
85
VIEWS
Jun 22, 2017 Jefferson Bailey
collection
eye 85
Promoting online access to web archives by posting, online, pictures of me looking at reading-room-access-only web archives. 
Topics: web archiving, open access, digital preservation
Jun 10, 2017 Jefferson Bailey
collection
eye 200
Materials from the National Symposium on Web Archiving Interoperability, part of the WASAPI project, held February 21-22, 2017 at Internet Archive, San Francisco, CA. Additional, web-based presentations: https://wayback.archive-it.org/9135/20170712185836/http://labs.rhizome.org/presentations/wasapi-symposium-2017.html#/ https://wayback.archive-it.org/9135/20170712185836/http://www.gregwiedeman.com/presentations/slides/wasapi.html#/
Topics: web archiving, wasapi, digital preservation
WASAPI Project - Web Archiving Systems APIs
Jun 8, 2017 Stanford University Libraries
movies
eye 31
favorite 0
comment 0
Video demonstration of work on utilities built by Stanford University Libraries using the WASAPI data transfer API.
Topics: web archiving, digital preservation, APIs
WASAPI Project - Web Archiving Systems APIs
May 27, 2017 Stanford University Libraries
movies
eye 36
favorite 0
comment 0
Video demonstration of work on utilities built by Stanford University Libraries using the WASAPI data transfer API.
Topics: web archiving, APIs, digital preservation
WASAPI Project - Web Archiving Systems APIs
May 27, 2017 Stanford University Libraries
movies
eye 39
favorite 0
comment 0
Video demonstration of work on utilities built by Stanford University Libraries using the WASAPI data transfer API.
Topics: web archiving, APIs, digital preservation
WASAPI Project - Web Archiving Systems APIs
May 26, 2017 Jefferson Bailey
texts
eye 93
favorite 0
comment 0
Year 1 performance report for the WASAPI project, IMLS grant LG-71-15-0174-15, "Systems Interoperability and Collaborative Development for Web Archiving."
Topics: web archiving, digital preservation, APIs
WASAPI Project - Web Archiving Systems APIs
collection
13
ITEMS
462
VIEWS
May 26, 2017
collection
eye 462
Collection of materials related to the WASAPI project, an IMLS-funded initiative on " Systems Interoperability and Collaborative Development for Web Archiving ." Code and other materials can also be found at the project's Github page . 
Topics: web archiving, digital preservation, APIs
End Of Term 2016 Library of Congress Crawls
collection
3,892
ITEMS
1.8M
VIEWS
May 14, 2017
collection
eye 1.8M
End of Term 2016 Web Archive government web crawls by project partner the Library of Congress.
Topics: end of term, federal government, 2016, president, congress, library of congress, web, data, library...
SanFranciscoBayGuardianCrawl
collection
1
ITEMS
3,839
VIEWS
Apr 12, 2017
collection
eye 3,839
Crawl of SFBG done at the request of the paper in 2014.
Topics: news, san francisco
Government Web & Data Archive
collection
6,269
ITEMS
3.6M
VIEWS
Apr 6, 2017
collection
eye 3.6M
This collaborative project is an extension of the 2016  End of Term  project, intended to document the federal government's web presence by archiving government websites and data. As part of this preservation effort, URLs supplied from partner institutions, as well as nominated by the public, will be crawled regularly to provide an on-going view of federal agencies' web and social media presence. Key partners on this effort are the Environmental Data & Governance...
Topics: government, data, federal, congress
Apr 2, 2017
collection
eye 3,393
The Environmental Data & Governance Initiative (EDGI)  is an international network of academics and non-profits addressing potential threats to federal environmental and energy policy, and to the scientific research infrastructure built to investigate, inform, and enforce. More information is available at  https://envirodatagov.org/
Topics: data, environment, climate, government
NYPL Labs Archive
collection
0
ITEMS
2,470
VIEWS
Apr 2, 2017
collection
eye 2,470
Archive of NYPL Labs.  https://web.archive.org/web/*/https://www.nypl.org/collections/labs
Topics: libraries, labs, digital, innovation
web_domain_tests
collection
4,061
ITEMS
8M
VIEWS
Mar 28, 2017
collection
eye 8M
WARCs from internal crawl testing.
Topics: web, cctld
Obama White House Social Media Archive
collection
8
ITEMS
571
VIEWS
Jan 3, 2017 The Obama White House
collection
eye 571
Collection of the official social media of the Obama White House, made available for public access in January 2017 prior to the end of the Obama administration by the White House Office of Digital Strategy. The collection includes data from Twitter, Tumblr, Vine, Facebook, and YouTube (pending). For more information on the public release of Obama's social media data, see these White House posts: ...
Topics: obama, white house, government data, social media
Community Video
Dec 29, 2016 Vinay Goel
movies
eye 58
favorite 0
comment 0
A timelapse of the whitehouse.gov homepage from Jan 20, 2009 to Dec 19, 2016 using the Internet Archive Wayback Machine and web archive. Created for the White House Social Media Data Hackathon at Internet Archive on Jan 7, 2017.
Topics: whitehouse, obama, timelapse
End of Term 2016 Post-Inauguration Crawls
collection
5,308
ITEMS
1.8M
VIEWS
Dec 15, 2016
collection
eye 1.8M
This collection contains web crawls performed as the post-inauguration crawl for part of the End of Term Web Archive, a collaborative project that aims to preserve the U.S. federal government web presence at each change of administration. Content includes publicly-accessible government websites hosted on .gov, .mil, and relevant non-.gov domains, as well as government social media materials. The web archiving was performed in the Winter of 2016  and Spring of 2017 to capture websites...
Topics: end of term, federal government, 2016, president, congress
End Of Term 2016 Pre-Inauguration Crawls
collection
4,693
ITEMS
4.7M
VIEWS
Dec 15, 2016
collection
eye 4.7M
This collection contains web crawls performed as the pre-inauguration crawl for part of the End of Term Web Archive, a collaborative project that aims to preserve the U.S. federal government web presence at each change of administration. Content includes publicly-accessible government websites hosted on .gov, .mil, and relevant non-.gov domains, as well as government social media materials. The web archiving was performed in the Fall and Winter of 2016 to capture websites prior to the January...
Topics: end of term, federal government, 2016, president, congress
End of Term 2016 Web Crawls
collection
19,084
ITEMS
12.1M
VIEWS
Nov 21, 2016
collection
eye 12.1M
This collection contains web crawls performed as part of the End of Term Web Archive, a collaborative project that aims to preserve the U.S. federal government web presence at each change of administration. Content includes publicly-accessible government websites hosted on .gov, .mil, and relevant non-.gov domains, as well as government social media materials. The web archiving was performed in the Fall and Winter of 2016 and Spring of 2017. For more information, see...
Topics: end of term, federal government, 2016, president, congress, government data
Community Video
Oct 25, 2016 Trevor Thornton
movies
eye 780
favorite 2
comment 1
Data visualization of animated GIFs from the Geocities web archive. The visualization was done by staff at the Hunt Library at North Carolina State University and displayed in their data visualization lab. The project was done in association with the Internet Archive's 20th Anniversary celebrating the history of archiving the web.
favoritefavoritefavoritefavoritefavorite ( 1 reviews )
Topics: geocities, animated gifs, data visualization
Military Industrial Powerpoint Complex
collection
1,272
ITEMS
367,474
VIEWS
Oct 21, 2016 United States Military
collection
eye 367,474
This collection was a special project originally done as part of the Internet Archive's 20th Anniversary celebration on October 26, 2016 highlighting IA's web archive. The collection consists of all the Powerpoint files (57,489) from the .mil web domain that were crawled from the public web (with no special login or credentials) by the Internet Archive and partners from 1996-2017. The original release in October 2016 featured 48,110 Powerpoint files. Another 9,379 unique new Powerpoint files,...
Topics: military, industry, powerpoint, complex
SM feed test
Jul 15, 2016
web
eye 25
favorite 0
comment 0
Community Video
Jun 15, 2016
movies
eye 73
favorite 0
comment 0
Web Archive sculpture donated from Internet Archive to Library of Congress in 1998. Video from 2016, noting it's semi-functional status and location in staff offices in the Adams building. 
Topics: wayback, web archives, art, sculpture
Flickr Commons Archive
May 11, 2016
image
eye 36
favorite 0
comment 0
cuba flag
Topic: cuba
Estonian Web Domain
collection
0
ITEMS
23
VIEWS
Jan 31, 2016 Jefferson Bailey
collection
eye 23
Overview of the historical Estonian web domain (.ee).
Topics: web, estonia
Cuban Web Domain Crawl
collection
385
ITEMS
1.3M
VIEWS
Dec 29, 2015
collection
eye 1.3M
This collection is a snapshot of the Cuban Web Domain (.cu) from early 2016.
IMLS Museum Universe Data File Crawl
collection
2,876
ITEMS
14.9M
VIEWS
Dec 19, 2015
collection
eye 14.9M
2015 crawl of museum websites listed in the IMLS Museum Universe Data File. More about the IMLS MUDF can be found at https://www.imls.gov/research-evaluation/data-collection/museum-universe-data-file
Topic: AIT
Internet Archive Presents
Nov 3, 2015 Jefferson Bailey, Maria LaCalle
texts
eye 106
favorite 0
comment 0
Presentation by Jefferson Bailey and Maria LaCalle at the Preservation Metadata Interest Group at the ALA 2015 conference in San Francisco (Frisco) California.
Topics: web archiving, metadata, digital preservation, archives, libraries
Community Texts
Jun 9, 2015
texts
eye 25
favorite 0
comment 0
Lat-Lon CSV for ARS workshops
Topic: data
Community Video
Feb 13, 2015 Jefferson Bailey, Herbert Van de Sompel
movies
eye 47
favorite 0
comment 0
Slides and audio from Jefferson Bailey, Program Manager, Internet Archive and  Herbert Van de Sompel, Digital Library Research & Prototyping, Los Alamos National Laboratory as part of the "Strategies I" panel from the Georgetown Law Library symposium "404/File Not Found: Link Rot, Legal Citation and Projects to Preserve Precedent" on October 24, 2014 in Washington, D.C. Uploaded by me, Jefferson, so we could embed it in a blog post -- not as some weird vanity thing! I...
Topics: web archiving, digital preservation, archives, link rot, libraries
Community Video
Dec 17, 2014 jefferson_bailey
movies
eye 302
favorite 0
comment 0
A mash-up supercut of all the ducking and covering in the Federal Civil Defense film "Duck and Cover" ( https://archive.org/details/gov.ntis.ava11109vnb1 ) scored with "Silent Night" by @nullsleep of 8bitpeoples from "The 8bits of Christmas" ( https://archive.org/details/8bp038 ). Made for holiday season 2014 -- donate to Internet Archive! https://archive.org/donate/
Topics: ducking, covering, chiptune, holiday, nuclear apocalypse
Ferguson Tweets
Nov 26, 2014 Ed Summers
data
eye 41
favorite 0
comment 0
417,972 URLs and corresponding tweet IDs that tweeted that URL, all in a fat old .tsv extracted from Ed Summers' collection of 13,480,000 tweet IDs that mentioned 'ferguson' from 2014-08-10 22:44:43 to 2014-08-27 15:15:50 that is here: https://archive.org/details/ferguson-tweet-ids. Thanks to Ed Summers for doing this!
Topics: twitter, ferguson
Ferguson Tweets
Nov 25, 2014 Ed Summers
data
eye 68
favorite 0
comment 0
417,972 URLs extracted from Ed Summers' collection of 13,480,000 tweet IDs that mentioned 'ferguson' from 2014-08-10 22:44:43 to 2014-08-27 15:15:50 listed here: https://archive.org/details/ferguson-tweet-ids
Topics: twitter, ferguson
More Podcast, Less Process
collection
10
ITEMS
1,095
VIEWS
Sep 22, 2014
collection
eye 1,095
More Podcast, Less Process
Topic: listmania
More Podcast, Less Process
Jun 9, 2014 Metropolitan New York Library Council and AudioVisual Preservation Solutions
audio
eye 126
favorite 0
comment 0
More Podcast, Less Process is a podcast featuring interviews with archivists, librarians, preservationists, technologists, and information professionals about interesting work and projects within and involving archives, special collections, and cultural heritage. Topics include appraisal and acquisition, arrangement and description, reference, outreach and education, collection management, physical and digital preservation, and infrastructure and technology. More  information: ...
Topics: archives, libraries, digitization, preservation, performing arts
More Podcast, Less Process
May 6, 2014 Metropolitan New York Library Council and AudioVisual Preservation Solutions
audio
eye 69
favorite 0
comment 0
More Podcast, Less Process is a podcast featuring interviews with archivists, librarians, preservationists, technologists, and information professionals about interesting work and projects within and involving archives, special collections, and cultural heritage. Topics include appraisal and acquisition, arrangement and description, reference, outreach and education, collection management, physical and digital preservation, and infrastructure and technology. More  information:...
Topics: archives, preservation, podcast, history, electronic records, formats, records management
More Podcast, Less Process
Mar 31, 2014 Metropolitan New York Library Council and AudioVisual Preservation Solutions
audio
eye 96
favorite 0
comment 0
More Podcast, Less Process is a podcast featuring interviews with archivists, librarians, preservationists, technologists, and information professionals about interesting work and projects within and involving archives, special collections, and cultural heritage. Topics include appraisal and acquisition, arrangement and description, reference, outreach and education, collection management, physical and digital preservation, and infrastructure and technology. More  information: ...
Topics: archives, video, media, film, preservation, conservation, libraries
More Podcast, Less Process
Feb 24, 2014 Metropolitan New York Library Council and AudioVisual Preservation Solutions
audio
eye 233
favorite 1
comment 0
More Podcast, Less Process is a podcast featuring interviews with archivists, librarians, preservationists, technologists, and information professionals about interesting work and projects within and involving archives, special collections, and cultural heritage. Topics include appraisal and acquisition, arrangement and description, reference, outreach and education, collection management, physical and digital preservation, and infrastructure and technology. More  information:...
Topics: archives, libraries, web archiving, digital preservation, columbia, new york art resources...
More Podcast, Less Process
Jan 30, 2014 Metropolitan New York Library Council and AudioVisual Preservation Solutions
audio
eye 73
favorite 0
comment 0
More Podcast, Less Process is a podcast featuring interviews with archivists, librarians, preservationists, technologists, and information professionals about interesting work and projects within and involving archives, special collections, and cultural heritage. Topics include appraisal and acquisition, arrangement and description, reference, outreach and education, collection management, physical and digital preservation, and infrastructure and technology. More  information: ...
Topics: archives, libraries, special collections, research, genealogy
More Podcast, Less Process
Dec 23, 2013 Metropolitan New York Library Council and AudioVisual Preservation Solutions
audio
eye 74
favorite 0
comment 0
More Podcast, Less Process is a podcast featuring interviews with archivists, librarians, preservationists, technologists, and information professionals about interesting work and projects within and involving archives, special collections, and cultural heritage. Topics include appraisal and acquisition, arrangement and description, reference, outreach and education, collection management, physical and digital preservation, and infrastructure and technology. More information:...
Topics: archives, libraries, special collections, film preservation, research
More Podcast, Less Process
Dec 9, 2013 Metropolitan New York Library Council and AudioVisual Preservation Solutions
audio
eye 95
favorite 0
comment 0
More Podcast, Less Process is a podcast featuring interviews with archivists, librarians, preservationists, technologists, and information professionals about interesting work and projects within and involving archives, special collections, and cultural heritage. Topics include appraisal and acquisition, arrangement and description, reference, outreach and education, collection management, physical and digital preservation, and infrastructure and technology. Episode 004 features archivist,...
Topics: archives, libraries, special collections, preservation, conservation
More Podcast, Less Process
Nov 11, 2013 Metropolitan New York Library Council and AudioVisual Preservation Solutions
audio
eye 89
favorite 0
comment 0
More Podcast, Less Process is a podcast featuring interviews with archivists, librarians, preservationists, technologists, and information professionals about interesting work and projects within and involving archives, special collections, and cultural heritage. Topics include appraisal and acquisition, arrangement and description, reference, outreach and education, collection management, physical and digital preservation, and infrastructure and technology. Episode 003 features  Janet Bunde,...
Topics: archives, libraries, special collections, education, research
More Podcast, Less Process
Oct 22, 2013 Metropolitan New York Library Council and AudioVisual Preservation Solutions
audio
eye 108
favorite 0
comment 0
More Podcast, Less Process is a podcast featuring interviews with archivists, librarians, preservationists, technologists, and information professionals about interesting work and projects within and involving archives, special collections, and cultural heritage. Topics include appraisal and acquisition, arrangement and description, reference, outreach and education, collection management, physical and digital preservation, and infrastructure and technology. Episode 002 features Grace Lile and...
Topics: archives, libraries, special collections, preservation, conservation, activism, advocacy
More Podcast, Less Process
Oct 7, 2013 Metropolitan New York Library Council and AudioVisual Preservation Solutions
audio
eye 177
favorite 1
comment 0
More Podcast, Less Process is a podcast featuring interviews with archivists, librarians, preservationists, technologists, and information professionals about interesting work and projects within and involving archives, special collections, and cultural heritage. Topics include appraisal and acquisition, arrangement and description, reference, outreach and education, collection management, physical and digital preservation, and infrastructure and technology. Episode 001 features Mark Matienzo,...
Topics: archives, libraries, special collections, museums, preservation, conservation, digital...
Community Texts
Aug 11, 2013 Joseph Cuvelier, L. Stainer
texts
eye 1,343
favorite 1
comment 0
Proceedings of the International Congress of Archivists and Librarians held in Brussels in 1910. Multiple languages, mostly English, French, and Dutch. Table of contents is at end page 479/484 of pdf (pg 804 in the original work).
Topics: archives, libraries